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Abstract 

The mathematical study of the small- world concept has fostered quite some interest, 
showing that small-world features can be identified for some abstract classes of 
networks. However, passing to real complex systems, as for instance transportation 
networks, shows a number of new problems that make current analysis impossible. In 
this paper we show how a more refined kind of analysis, relying on transportation 
efficiency, can in fact be used to overcome such problems, and to give precious 
insights on the general characteristics of real transportation networks, eventually 
providing a picture where the small-world comes back as underlying construction 
principle. 
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The characterization of the structural properties of the underlying network 
is a very crucial issue to understand the function of a complex system [1]. 
For example, the structure of a social network affects spreading of informa- 
tion, fashions, rumors but also of epidemics over the network; the topological 
properties of a computer network (Internet, the World Wide Web) affect the 
efficiency of the communication. Only recently the accessibility of databases of 
real networks and the availability of powerful computers have made possible 
a series of empirical studies [2-6]. In [2] Watts and Strogatz have shown that 
the connection topology of some (social, biological and technological) networks 
is neither completely regular nor completely random [2]. Watts and Strogatz 
have named these networks, that are somehow in between regular and random 
networks, small worlds, in analogy with the small-world phenomenon observed 
in social systems [7]. The mathematical characterization of the small- world 
behavior is based on the evaluation of two quantities, the characteristic path 
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length L, measuring the typical separation between two generic nodes in the 
network and the clustering coefficient C, measuring the average cliquishness of 
a node. Small-world networks are in fact highly clustered, like regular lattices, 
yet having small characteristics path lengths, like random graphs. 

Although the initial small-world concept came from social networks, having a 
mathematical characterization makes it tempting to apply the same concept 
to any network representative of a complex system. This grand plan clashes 
with the fact that the mathematical formalism of [2] suffers from severe limi- 
tations: 1) it applies only to some cases, whereas in general the two quantities 
L and C are ill-defined; 2) it works only in the topological abstraction, where 
the only information retained is about the existence or the absence of a link, 
and nothing is known about the physical length of the link. 
In this paper we take as paradigmatic example of real complex systems the 
realm of transportation (and use the Boston public transportation system as 
real- world representative instance), showing how the passage from abstract 
social networks to applied complex systems present in nature poses new chal- 
lenges, that can in fact be overcome using a more general formalism developed 
in ref. [8] for weighted networks. 

The MBTA (Boston underground transportation system) consists of iV = 124 
stations and K = 124 tunnels (connecting couples of stations) extending 
throughout Boston and the other cities of the Massachusetts Bay [9]. This 
network can be considered as a graph with N nodes and K edges and is rep- 
resented by the adjacency (or connection) matrix {a^}, i.e. the N • N matrix 
whose entry is 1 if there is an edge joining node i to node j and otherwise, 
and by {%} the matrix of the spatial (geographical) distances between sta- 
tions. According to the formalism of ref. [2], valid for a subclass of unweighted 
(topological) networks, the information contained in {iij} is not used (as if 
£ij — 1 Mi ^ j) and the shortest path length between two generic vertices 
i and j is extracted by using only {a^}. The characteristic path length L is 
the average distance between two generic vertices: L = N ^_^ Y^i^j dij. The 
clustering coefficient C is a local property defined as follows. If the node i 
has ki neighbors, then at most kiiki — l)/2 edges can exist between them; Ci 
is the fraction of these edges that actually exist, and C is the average value 
C — Ci- If we apply this method to try to study the MBTA, we ob- 
tain L = 15.55 (an average of 15 steps, or 15 stations to connect 2 generic 
stations), while C is not well defined since there are few nodes nodes with 
only 1 neighbours, and then Ci = jj for these nodes. In any case to decide 
if the MBTA is a small world we have to compare the L and C obtained to 
the respective values for a random graph with the same N and K. When we 
consider a random graph we incur into the same problem for C; moreover we 
get L = oo because in most of the realizations of the random graph there are 
some nodes not connected to the remaining part of the network. Summing up, 
by mean of L and C we are unable to draw any conclusion. 
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Fig. 1. The network of the MBTA consists of N = 124 stations and K = 124 tunnels. 
The matrix {lij} has been calculated using databases from the MBTA [9] and the 
U.S. National Mapping Division. 

Now we propose our alternative formalism (based on ref. [8]), valid for weighted, 
and also disconnected networks. The matrix of the shortest path lengths {dij} 
is now calculated by using the information contained both in {a^} and in {lij}. 
Instead of L and C, the network is characterized in terms of how efficiently 
it propagates information on a global and on a local scale respectively. We 
assume that the efficiency in the communication between node i and j is 
inversely proportional to the shortest distance: = We see im- 

mediately that this way we avoid the problem of the divergence we had for L, 
in fact when there is no path in the graph between i and j, dij = +oo and 
consistently = 0. Moreover, the link characteristics (length/capacity in the 
case of transportation systems) are properly taken into account, and not flat- 
tened into their topological abstraction. We define the network efficiency as 
E = n(n-i) ^2i¥=o €i i = n(n-i) 1~- Q uan tity E is normalized to the 
efficiency of the ideal case in which the network has all the iV(iV — l)/2 possi- 
ble edges: in this way < E < 1 [8]. We call -E g i b the efficiency of the whole 
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Table 1 

Global and local efficiency and cost of the MBTA. In the first row the MBTA is 
considered. In the second raw the composite system MBTA+bus is considered. 





-Eglob 




Cost 


MBTA 


0.63 


0.03 


0.002 


MBTA+bus 


0.72 


0.46 


0.004 



network and E\ oc the average efficiency of the subgraph of the neighbors of a 
generic node i. In ref.[8] we have shown that E g \ ^ and E\ oc play respectively 
the role of L and C, and that small-world networks have both high E g \ oh and 
high E loc . 

Now, let us apply these new measures to the MBTA: the results are reported in 
tab.l. As we can see, the MBTA turns out to be a very efficient transportation 
system on a global scale but not at the local level. Let us analyze better what 
insights the calculation shows. In fact, -E g i b — 0.63 means that MBTA is only 
37% less efficient than the ideal subway with a direct tunnel from each station 
to the others, quite a remarkable result. On the other hand, E\ oc = 0.03 indi- 
cates a poor local efficiency: this shows that, differently from social systems, 
the MBTA is not fault tolerant and a damage in a station will dramatically af- 
fect the efficiency in the connection between the previous and the next station. 
In order to better understand the difference with respect to other systems that 
are globally but also locally efficient we need to consider the cost of a network. 
In general we expect the efficiency of a network to be higher when the number 
of edges increase. As a counterpart, in any real network there is a price to pay 
for number and length (weight) of edges. To quantify this effect we define the 
cost of a network as: Cost = a ij^ij/J2i=ij Uj- We have < Cost < 1, and 
the the maximum value 1 is obtained for the ideal case when all the edges 
are present in the network. Cost reduces to the normalized number of edges 
2K/N(N — 1) in the case of an unweighted graph. For the MBTA we get an 
extremely small value Cost = 0.002. This means that MBTA achieves the 63% 
of the efficiency of the ideal subway with a cost of only the 0.2%. Qualitatively 
similar results have been obtained for other underground systems. The price 
to pay for such low-cost high global efficiency is the lack of fault tolerance. 
This means that when we build a subway system, the priority is given to the 
achievement of global efficiency at a relatively low cost, and not to fault tol- 
erance. But where is the rationale for such a construction principle? In fact, 
fault tolerance in such a transportation system is less of a critical issue as it 
would seem: a temporary problem in a station can be solved in an economic 
way by other means, for example by taking a bus from the previous to the next 
station. That is to say, lack of fault tolerance for users is only apparent: the 
MBTA is not a closed system, as it can be considered, after all, a subgraph of 
a wider transportation network, and this explains why, fault tolerance is not a 
critical issue. Changing the MBTA network to take into account for example 
the bus systems, indeed, shows that this extended transportation system is 
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a small-world network (-Egiob = 0.72, E\ oc = 0.46)! Therefore, efficiency and 
fault-tolerance come back as a leading underlying construction principle, and 
the whole transportation system MBTA+bus turns out to be a small-world 
with a slight increase in the cost (Cost = 0.004). 

Summing up, the analysis of real-life complex systems like transportation net- 
works poses a number of new challenges, that make the initial mathemati- 
cal formalization of small worlds in ref. [2] fail. The introduction of the effi- 
ciency measure allows to give a more general mathematical definition of small 
worlds, able to deal successfully with transportation systems (and in general, 
for weighted networks). Such measure, like in the MBTA case, provides quan- 
titative information on the efficiency characteristics of a system, helping to 
explain the underlying construction principles. Moreover, apparent lack of a 
generalized small-world behaviour can, as in the MBTA case, be explained 
by the fact we have just a partial view of the complete system. In fact, the 
analysis presented in this paper shows that a generic closed transportation 
system can exhibit the small- world behavior, substantiating the idea that, in 
the grand picture, the diffusion of small-world networks can be interpreted as 
the need to create networks that are both globally and locally efficient. 
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